Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Molecules ; 26(9)2021 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-33923273

RESUMO

Many gram-negative bacteria use type IV secretion systems to deliver effector molecules to a wide range of target cells. These substrate proteins, which are called type IV secreted effectors (T4SE), manipulate host cell processes during infection, often resulting in severe diseases or even death of the host. Therefore, identification of putative T4SEs has become a very active research topic in bioinformatics due to its vital roles in understanding host-pathogen interactions. PSI-BLAST profiles have been experimentally validated to provide important and discriminatory evolutionary information for various protein classification tasks. In the present study, an accurate computational predictor termed iT4SE-EP was developed for identifying T4SEs by extracting evolutionary features from the position-specific scoring matrix and the position-specific frequency matrix profiles. First, four types of encoding strategies were designed to transform protein sequences into fixed-length feature vectors based on the two profiles. Then, the feature selection technique based on the random forest algorithm was utilized to reduce redundant or irrelevant features without much loss of information. Finally, the optimal features were input into a support vector machine classifier to carry out the prediction of T4SEs. Our experimental results demonstrated that iT4SE-EP outperformed most of existing methods based on the independent dataset test.


Assuntos
Evolução Molecular , Bactérias Gram-Negativas/genética , Interações Hospedeiro-Patógeno/genética , Sistemas de Secreção Tipo IV/genética , Sequência de Aminoácidos/genética , Infecções Bacterianas/tratamento farmacológico , Infecções Bacterianas/genética , Infecções Bacterianas/microbiologia , Biologia Computacional , Bactérias Gram-Negativas/patogenicidade , Humanos , Sistemas de Secreção Tipo IV/química
2.
Comput Math Methods Med ; 2020: 1384749, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32300371

RESUMO

Prediction of DNA-binding proteins (DBPs) has become a popular research topic in protein science due to its crucial role in all aspects of biological activities. Even though considerable efforts have been devoted to developing powerful computational methods to solve this problem, it is still a challenging task in the field of bioinformatics. A hidden Markov model (HMM) profile has been proved to provide important clues for improving the prediction performance of DBPs. In this paper, we propose a method, called HMMPred, which extracts the features of amino acid composition and auto- and cross-covariance transformation from the HMM profiles, to help train a machine learning model for identification of DBPs. Then, a feature selection technique is performed based on the extreme gradient boosting (XGBoost) algorithm. Finally, the selected optimal features are fed into a support vector machine (SVM) classifier to predict DBPs. The experimental results tested on two benchmark datasets show that the proposed method is superior to most of the existing methods and could serve as an alternative tool to identify DBPs.


Assuntos
Algoritmos , Proteínas de Ligação a DNA/química , Aprendizado de Máquina , Sequência de Aminoácidos , Aminoácidos/análise , Biologia Computacional , Proteínas de Ligação a DNA/genética , Bases de Dados de Proteínas/estatística & dados numéricos , Humanos , Cadeias de Markov , Curva ROC , Máquina de Vetores de Suporte
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...